The Astronomy Commons Platform: A Deployable Cloud-based Analysis Platform for Astronomy
نویسندگان
چکیده
Abstract We present a scalable, cloud-based science platform solution designed to enable next-to-the-data analyses of terabyte-scale astronomical tabular data sets. The presented is built on Amazon Web Services (over Kubernetes and S3 abstraction layers), utilizes Apache Spark the Astronomy eXtensions for parallel analysis manipulation, provides familiar JupyterHub web-accessible front end user access. outline architecture platform, provide implementation details rationale (and against) technology choices, verify scalability through strong weak scaling tests, demonstrate usability an example from Zwicky Transient Facility’s 1Bn+ light-curve catalog. Furthermore, we show how this system enables iteratively build (in Python) that transparently scale processing with no need end-user interaction. be deployable by astronomers moderate cloud engineering knowledge, or (ideally) IT groups. Over past 3 yr, it has been utilized platforms DiRAC Institute, ZTF partnership, LSST Solar System Science Collaboration, Interdisciplinary Network Collaboration Computing, as well numerous short-term events (with over 100 simultaneous users). In live demo instance, deployment scripts, source code, cost calculators are accessible. 4 http://hub.astronomycommons.org/
منابع مشابه
The Modern Cloud-Based Platform
ing and Library Use: Abstracting is permitted with credit to the source. Libraries are permitted to photocopy for private use of patrons, provided the per-copy fee indicated in the code at the bottom of the rst page is paid through the Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923. NEXT ISSUE:
متن کاملASTROMLSKIT: A New Statistical Machine Learning Toolkit: A Platform for Data Analytics in Astronomy
Astroinformatics is a new impact area in the world of astronomy, occasionally called the final frontier, where several astrophysicists, statisticians and computer scientists work together to tackle various data intensive astronomical problems. Exponential growth in the data volume and increased complexity of the data augments difficult questions to the existing challenges. Classical problems in...
متن کاملSage Bionetworks Commons and Platform
Perspective: A Need for Better Maps of Disease The revolution in health care that was anticipated by the sequencing of the human genome has failed to materialize 1. The failure rate for drugs in clinical development is still startlingly high despite unprecedented investment in R&D that reached a record $65 billion in 2009. This is largely due to the very high attrition rate for compounds in cli...
متن کاملBiologic Data Analysis Platform Based on the Cloud
To improve the research productivity in bioinformatics study by using effective means of large scale data analysis, there are many obstacles that need to be overcome They are standardization of data collection and analysis, management of computing and storage resources, easiness of parallel programming, and efficiency of data analysis job execution, to name a few. Among these, easiness of paral...
متن کاملAstronomy in the Cloud: Using MapReduce for Image Coaddition
In the coming decade, astronomical surveys of the sky will generate tens of terabytes of images and detect hundreds of millions of sources every night. The study of these sources will involve computation challenges such as anomaly detection and classification and moving-object tracking. Since such studies benefit from the highest-quality data, methods such as image co-addition, i.e., astrometri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Astronomical Journal
سال: 2022
ISSN: ['1538-3881', '0004-6256']
DOI: https://doi.org/10.3847/1538-3881/ac77fb